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AMENDMEN T TO THE CLAIMS 

1. (Currently Amended) A method of segmenting words into component parts, the method 
comprising: 

determining a mutual information scores for a pair of graphoneme units.! 
comprising a first graphoneme unit and a second graphoneme unit, using 
the probability of the first graphoneme unit appearing immediately after 
the second g ra phoneme unit, the unigram probability of the first 
gra phoneme unit and the unigram probability of the second graphoneme 
unit, each graphoneme unit comprising at least one letter in the spelling of 
a word; 

using the mutual information scores to combine the first and second graphoneme 

units into a larger graphoneme unit; and 
segmenting words into component parts to form a sequence of graphonemes based 

on the larger graphoneme unit , 

2. (Currently Amended) The method of claim 1 wherein combining graphoneme unit s 
comprises combining the letters of each graphoneme unit to produce a sequence of letters for the 
larger graphoneme unit and combining fee-phones of each graphoneme unit . to produce a 
sequence of phones for the larger graphoneme unit. 

3. (Original) The method of claim 1 further comprising using the segmented words to generate 
a model. 

4. (Original) The method of claim 3 wherein the model describes the probability of a 
graphoneme unit given a context within a word. 

5. (Original) The method of claim 4 further comprising using the model to determine a 
pronunciation of a word given the spelling of the word. 
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6. (Currently Amended) The method of claim 1 wherein using the mutual information 
scores comprises summing at least two mutual information scores determined for a single larger 
graphoneme unit to form a strength, 

7. (Currently Amended) A computer-readable medium having computer-executable 
instructions for performing steps comprising: 

determining mutual information scores for pairs of graphoneme units found in a 
set of words, each graphoneme unit comprising at least one lette r and each 
mutual information score for a pair of graphoneme units based on the 
probabi lity, of one graphoneme u nit of the pair of graphoneme units 
appealing, i mmediately after the other graphoneme unit of the pair of 
grapho neme units, and the unigram probabilities of each g ra phoneme unit 
in the pair of graphoneme units ; 

combining the graphoneme units of one pair of graphonome units to form a new 
graphoneme unit based on the mutual information scores; and 

identifying a set of graphoneme units for a word based in part on the new 
graphoneme unit. 

8. (Original) The computer-readable medium of claim 7 wherein combining the graphoneme 
units comprises combining the letters of the graphoneme units to form a sequence of letters for 
the new graphoneme unit. 

9. (Original) The computer-readable medium of claim 8 wherein combining the graphoneme 
units further comprises combining the phones of the graphoneme units to form a sequence of 
phones for the new gaphoneme unit. 
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10, (Original) The computer-readable medium of claim 7 further comprising identifying a set of 
graphonemes for each word in a dictionary. 

1L (Original) The computer-readable medium of claim 10 further comprising using the sets of 
graphonemes identified for the words in the dictionary to train a model 

12. (Original) The computer-readable medium of claim 11 wherein the model describes the 
probability of a graphoneme unit appearing in a word, 

13. (Original) The computer-readable medium of claim 12 wherein the probability is based on at 
least one other graphoneme unit in the word. 

14. (Original) The computer-readable medium of claim 1 1 further comprising using the model to 
determine a pronunciation for a word given the spelling of the word. 

15. (Original) The computer-readable medium of claim 7 wherein combining graphoneme units 
based on the mutual information score comprises summing at least two mutual information 
scores associated with a new graphoneme unit. 

16. (Currently Amended) A method of segmenting a word into syllables, the method 
comprising: 

segmenting a set of words into phonetic syllables using mutual information scores 
wherein using a m utual infomiation score comprises computing a mutual 
information scor e for two phones bas ed on the probability of two phones 
appearing next to each other in the set of words and the unjgram 
probabilities of each of the two phones in the set of words ; 

using the segmented set of words to train a syllable n-gram model; and 
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using the syllable n-gram model to segment a phonetic representation of a word 
into syllables via forced alignment. 

17. (Currently Amended) A method of segmenting a word into morphemes, the method 
comprising: 

segmenting a set of words into morphemes using mutual information scores 
wherein using mutual information scores comprises comp uting a mutual 
information s c ore for two letters based o n the probabi lity of the two letters 
appearing next to each other in the set of words and the unigram 
probabilities of each of the two letters in the set of words ; 
using the segmented set of words to train a morpheme n-gram model; and 
using the morpheme n-gram model to segment a word into morphemes via forced 
alignment. 



